Rule-Based Knowledge Acquisition Method for Promoter Prediction in Human and Drosophila Species

نویسندگان

  • Wen-Lin Huang
  • Chun-Wei Tung
  • Chyn Liaw
  • Hui-Ling Huang
  • Shinn-Ying Ho
چکیده

The rapid and reliable identification of promoter regions is important when the number of genomes to be sequenced is increasing very speedily. Various methods have been developed but few methods investigate the effectiveness of sequence-based features in promoter prediction. This study proposes a knowledge acquisition method (named PromHD) based on if-then rules for promoter prediction in human and Drosophila species. PromHD utilizes an effective feature-mining algorithm and a reference feature set of 167 DNA sequence descriptors (DNASDs), comprising three descriptors of physicochemical properties (absorption maxima, molecular weight, and molar absorption coefficient), 128 top-ranked descriptors of 4-mer motifs, and 36 global sequence descriptors. PromHD identifies two feature subsets with 99 and 74 DNASDs and yields test accuracies of 96.4% and 97.5% in human and Drosophila species, respectively. Based on the 99- and 74-dimensional feature vectors, PromHD generates several if-then rules by using the decision tree mechanism for promoter prediction. The top-ranked informative rules with high certainty grades reveal that the global sequence descriptor, the length of nucleotide A at the first position of the sequence, and two physicochemical properties, absorption maxima and molecular weight, are effective in distinguishing promoters from non-promoters in human and Drosophila species, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

S3PSO: Students’ Performance Prediction Based on Particle Swarm Optimization

Nowadays, new methods are required to take advantage of the rich and extensive gold mine of data given the vast content of data particularly created by educational systems. Data mining algorithms have been used in educational systems especially e-learning systems due to the broad usage of these systems. Providing a model to predict final student results in educational course is a reason for usi...

متن کامل

مدل مدیریت دانش در معاونت‌های نظارت راهبردی و توسعه مدیریت و سرمایه انسانی رئیس‌جمهور

This study tries to pinpoint the knowledge elements and present an appropriate knowledge management (KM) application model for ‘Vice-Presidency for Strategic Planning and Supervision’ and ‘Vice-Presidency for Management and Human Capital Development ‘ (the staff body of former Management and Planning Organization) in Iran. The dependent and independent variables of the research were defined acc...

متن کامل

Stock price prediction using the Chaid rule-based algorithm and particle swarm optimization (pso)

Stock prices in each industry are one of the major issues in the stock market. Given the increasing number of shareholders in the stock market and their attention to the price of different stocks in transactions, the prediction of the stock price trend has become significant. Many people use the share price movement process when com-paring different stocks while investing, and also want to pred...

متن کامل

A Fugacity Approach for Prediction of Phase Equilibria of Methane Clathrate Hydrate in Structure H

In this communication, a thermodynamic model is presented to predict the dissociation conditions of structure H (sH) clathrate hydrates with methane as help gas. This approach is an extension of the Klauda and Sandler fugacity model (2000) for prediction of phase boundaries of sI and sII clathrate hydrates. The phase behavior of the water and hydrocarbon system is modeled using the Peng-Robinso...

متن کامل

A Link Prediction Method Based on Learning Automata in Social Networks

Nowadays, online social networks are considered as one of the most important emerging phenomena of human societies. In these networks, prediction of link by relying on the knowledge existing of the interaction between network actors provides an estimation of the probability of creation of a new relationship in future. A wide range of applications can be found for link prediction such as electro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2014  شماره 

صفحات  -

تاریخ انتشار 2014